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ABSTRACT 


A  simple  method  is  developed  for  approximating  confidence  intervals  for  a 
linear  change-point  model  representing  residential  energy  consumption.  The 
method  can  be  applied  to  any  nonlinear  model  which  is  conditionally  linear  in 
all  but  one  parameter.  The  approximation  is  shown  to  be  more  accurate  than 
the  Gaussian  approximation,  and  easier  to  compute  than  confidence  intervals 
defined  by  likelihood  (sum-of -squares)  regions.  Computations  are  described  in 
some  detail  for  the  motivating  model.  ^ 
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SIGNIFICANCE  AND  EXPLANATION 


A  method  is  developed  for  calculating  approximate  confidence  intervals 
for  certain  types  of  regression  models.  The  confidence  intervals  indicate  the 
accuracy  of  the  parameter  estimates.  The  method  is  compared  with  two  standard 
procedures  currently  in  common  use.  The  method  proposed  is  easier  to  compute 
than  the  more  complicated  procedure,  but  more  accurate  than  the  simpler  one. 
The  method  is  motivated  and  demonstrated  by  a  simple  model  of  residential 
energy  consumption. 
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1 .  Introduction 

Two  methods  are  commonly  used  to  obtain  approximate  confidence  intervals 
for  parameters  of  nonlinear  models  fit  by  the  method  of  least  squares.  First 
is  the  Gaussian  approximation,  based  on  the  asymptotic  normality  of  the  least 
squares  estimate.  The  second  is  the  likelihood  or  a um-of -squares  method, 
based  on  the  asymptotic  chi-squared  distribution  of  the  likelihood  ratio. 

Asymptotically,  for  infinite  sample  sixes,  the  two  methods  are 
equivalent.  In  small  samples,  however,  the  sum-of-squares  method  is  generally 
more  accurate,  in  the  sense  of  providing  regions  with  coverage  probabilities 
closer  to  their  nominal  confidence  levels.  The  Gaussian  approximation,  on  the 
other  hand,  based  on  an  implicit  linearisation  of  the  model  function,  is 
usually  much  easier  to  compute. 

A  simple  procedure  developed  in  this  paper  produces  approximate  sum-of- 
squares  regions.  These  regions  are  in  general  more  accurate  than  those  based 
on  the  Gaussian  approximation ,  but  easier  to  compute  than  the  actual  sum-of- 
squares  regions.  An  additional  advantage  of  the  method  described  below  is 
that  it  does  not  require  computation  of  derivatives  of  the  model  function. 
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N*  express  th*  general  nonlinear  regression  equation  in  matrix  form,  as 

y  -  n( 0)  +  e 

where  n  is  an  n-dimensional  vector  function  of  the  p- dimensional  parameter 
9,  and  e  is  a  random  disturbance,  also  of  dimension  n.  We  assume  the 
components  e  to  be  independent,  following  a  Normal  distribution  with  mean 
0  and  variance  a2. 

Attention  is  restricted  to  model  functions  n  which  are  conditionally 
linear  in  all  but  one  parameter.  That  is,  we  assume  the  unknown  parameter  8 
can  be  partitioned  as 

8  -  l~]  . 

with  dim(T)  -  1,  such  that 


n(6)  -  C(r)X  ,  (1.1) 

for  €  an  n  x  (p-1)  matrix  function  of  t.  We  further  confine  attention  to 
the  problem  of  obtaining  confidence  intervals  for  individual  components  8^ 
(i.e.,  for  t  and  for  X^,  j  ■  1, 2, . . . ,p-1 ) . 

Our  method  is  motivated  and  illustrated  by  application  to  a  simple  model 
of  residential  energy  use.  The  energy  model  is  described  in  Section  2,  and 
the  procedures  used  to  fit  the  model  in  Section  3.  In  Section  4,  methods  of 
obtaining  confidence  intervals  for  the  model's  parameters  are  discussed  in 
relation  to  the  fitting  procedure  used.  A  simplified  method  of  obtaining 
approximate  confidence  intervals  for  nonlinear  models  is  then  introduced  in 
Section  5,  and  is  applied  to  the  energy  model  in  Section  6.  We  conclude,  in 
Section  7,  with  a  discussion  of  the  advantages  of  this  method  for  general 


models  of  the  form  (1.1) 


2.  The  Energy  Model 


For  the  primary  heating  fuel  in  a  house,  a  simple  model  assumes  that 

consumption  on  a  particular  day  is  given  by  the  baseload  a  if  the  day's 

average  outdoor  temperature  T  is  above  a  reference  temperature  t,  and 

increases  in  proportion  to  t  -  T  if  t<t.  For  time  period  m,  let  Yn 

denote  the  average  daily  fuel  consumption,  Nm  the  number  of  days,  and  1^ 

the  average  number  of  degree-days  per  day,  to  base  t: 

N 

H  <T)  -  —  l  (T-T.)I(T  ,<T)  ,  (2.1) 

B  Nm  j-1  3  3 

where  I  is  the  indicator  function.  The  energy  model  is  then  expressed 
formally  as 

E(Y_)  -  n_(o,$,T) 
m  in 

-  a  +  BH  (t)  .  (2.2) 

n 

In  the  terms  of  Equation  (1.1),  we  have  X'  “  [a, 6),  where  the  apostrophe 
denotes  the  transpose,  and  £(t)  *  (1,H(T)] ,  where  1  denotes  an  n-vector 
of  ones. 

The  daily  temperature  data  TB j  are  obtained  in  integer  degrees 
Fahrenheit  from  a  nearby  U.S.  Heather  Bureau  station  (National  Oceanic  and 
Atmospheric  Administration,  monthly).  The  consumption  data  are  derived  from 
the  house's  monthly  utility  bills. 

The  model  defined  by  Equation  (2.2)  is  also  used  to  describe  aggregate 
data,  representing  a  utility  service  region  or  a  whole  state.  In  this  case, 
the  observations  Y_  are  average  daily  consumption  per  household  for  each 
month  m.  In  addition,  the  degree-day  variable  Hm  is  modified  to  account 
for  the  lag  introduced  by  meters'  being  read  on  different  days  throughout  the 
month.  The  variable  used  for  aggregate  analysis  in  this  work  is  defined 


H  (t) 


M»  **«-1 

m±j - - - a-1- -  .  (2.3) 


N  N  4 

n  m-1 

l  W  +1-J)  ♦  I  j 

j-1  a  j-1 


For  both  aggregate  and  individual-house  analyses,  a  major  use  of  the 
energy  model  is  to  calculate  normalised  annual  consumption  T.  the  index  F 
is  given  by 

r  -  365(a  +  0Ho(T)>  ,  (2.4) 

where  hq(t)  is  the  long-run  (say,  10 -year)  average  of  daily  degree-days  base 
t.  That  is,  for  both  single  and  aggregate  analyses,  Hq  is  computed  from 
Equation  (2.1),  with  period  0  indicating  a  base  period  of  several  years* 

The  index  F  represents  the  amount  of  fuel  the  house  (or  the  average 
household)  would  consume  in  a  year  of  typical  temperature  conditions. 

Comparing  values  of  F  obtained  from  two  successive  years  of  data  thus  offers 
a  measure  of  energy  conserved  (or  wasted)  idiich  is  unbiased  by  the  severity  or 
mildness  of  a  particular  winter. 

The  theoretical  basis  for  the  model  defined  by  Equation  (2.2)  is 
discussed  by  Fels  and  Goldberg  (1982),  who  present  analyses  of  aggregate 
residential  gas  consumption,  based  on  the  index  F.  Equation  (2.2)  and  the 
normalised  annual  consumption  F  have  also  been  used  to  analyse  a  large 
number  of  single  houses  heated  by  gas  (Dutt  et  al,  1982),  and  by  oil  (Fels 
et  al,  1981).  Extensions  of  the  method  to  electricity  use  in  single  houses 


are  under  way 


3.  Fitting  the  Energy  Modal 

Two  basic  procedural  have  baan  used  to  fit  Equation  (2.2).  One  is  a  grid 
search,  the  other  an  algorithm  based  m  Newton's  method.  Since  the  model 
function  n  is  linear  in  a  and  6  for  fixed  t,  both  procedures  start 

A 

with  a  search  for  the  "best  t."  Once  the  least  squares  estimate  t  is 

*  A 

found,  the  estimates  a  and  0  are  found  by  ordinary  linear  regression  of 

A 

Y  on  H(T). 

The  first  fitting  procedure,  which  is  easier  to  implement,  is  a  simple 
grid  search  taking  integer  (°F)  values  of  T  as  grid  points.  The  value  of 
t  for  which  T  and  H(t)  have  the  highest  correlation  is  the  best  (integer) 

A  A 

t,  and  the  corresponding  estimates  a  and  0  the  best  values  of  a  and 
0  for  that  T.  Of  course,  once  the  best  integer  T  is  found,  a  more  precise 
search  can  be  made  in  a  restricted  neighborhood  of  that  T,  either  by  taking 
a  finer  grid  or  by  using  steps  from  the  second  procedure,  described  below. 

The  grid  search  produces  the  likelihood  function  over  the  entire  range  of 
possible  reference  temperatures  T.  This  information  may  be  of  interest  in  a 
detailed  analysis,  but  for  routine  applications  the  grid  search  can  be  too 
cumbersome s  the  variable  H(t)  and  its  correlation  with  T  must  be 
evaluated  ordinarily  for  values  of  t  from  55°F  to  72°F,  and  in  some 
cases  for  a  much  wider  range. 

The  second  procedure,  based  on  Newton's  method,  is  more  efficient,  but 
somewhat  more  complicated,  requiring  the  derivatives  with  respect  to  a,  0 
and  t  of  the  model  function  n  given  by  Equation  (2.2).  For  this  model. 


the  derivatives  are  given  by 


3 [a,0,T] 


(1,  h(t)  ,  0  ^5^)  . 


The  model  function  n  is  classed  as  nonlinear  because  the  derivative  n 


depends  on  the  model  parameters. 


-5- 


In  fact,  the  energy  model  Is  not  only  nonlinear,  but  is  nondifferentiable 
as  walls  the  derivative  dH/dt  is  discontinuous  at  temperature 
observations  .  However,  except  at  the  points  t  ■  (which  occur  at 

integers)  dH/dT  is  well-defined.  Therefore,  arbitrarily  choosing  a  right- 
continuous  function,  we  define 


dH(T) 

m 

dT 


5  F  (T) 

n 


where,  for  a  single  house 

F  (T) 
in 

and  for  the  aggregate 


! 

N 


jE,  KT.^x) 


(3.1) 


F  (T) 
m 


N 

m 

l 

izl 


(N+1- j  )I  (T  <T ) 
n  nj 


l  (H+1- j )  +  l  j 
j-1  j-1 


(3.2) 


Of  course,  Fm  is  a  step  function,  representing  the  empirical  distribution  of 
the  observed  temperatures  Tm^. 

With  the  second,  derivative-based  procedure,  the  least  squares  estimate 

A 

T  is  found  by  solving  the  normal  equation 

{SFY  -  SHY  SHF/SHH}|t  ■  0  ,  (3.3) 

where  S..  denotes  a  sum  of  squares  or  of  cross-products,  corrected  for  the 

mean.  Using  Newton' s  method,  the  solution  is  found  iteratively  from  an 
(0 ) 

initial  value  T  ,  by  setting 


(i+1)  (i)  .SFY  SHH  -  SHY  SHF. ■ 

sFY  SHF  -  SHY  SFF'f^i)  *  '  ' 

Since  the  model  function  n  is  linear  between  integers  t,  exact 
convergence  (within  machine  accuracy)  occurs  in  one  step  after  t^  enters 

A 

the  integer  interval  containing  the  least  squares  estimate  t.  This  statement 
must  be  qualified,  however,  by  two  caveats. 
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We  are  seeking  the  minimum  of  the  function  RSS (t ) ,  given  by 
RSS(T)  *  SYY  -  {SHY2/SHH}|t  . 

This  function  has  a  discontinuous  derivative  (a  kink)  wherever  H(t)  has  a 
discontinuous  derivative  -  that  is,  at  integer  values  of  T.  Hence  we  have 

A  A 

the  first  caveat:  if  r  itself  is  an  integer,  then  the  minimum,  RSS(t),  is 

A 

at  a  cusp,  not  at  a  point  where  dRSS/dT  -  0.  That  is,  T  will  not  satisfy 
the  normal  equation  (3.3)  in  this  case,  so  that  the  iteration  (3.4)  will  never 

A 

converge  to  T,  but  will  oscillate  indefinitely  on  either  side. 

A 

The  second  caveat  is  that,  even  when  t  does  satisfy  Equation  (3.3), 
there  is  no  guarantee  that  the  value  determined  from  Equation  (3.4) 

A 

will  ever  reach  the  interval  containing  t.  Instead,  the  iteration  may 
oscillate  over  two  or  more  intervals,  excluding  the  right  one,  or  may  converge 
to  a  local  minimum  which  is  not  the  global  minimum. 

The  possible  shapes  of  the  function  RSS(t),  and  the  corresponding 
behavior  of  the  fitting  procedure,  have  been  described  in  detail  in  Goldberg 
(1982).  The  detailed  fitting  algorithm,  which  compensates  for  these 
difficulties,  is  given  in  the  Appendix.  The  performance  of  this  procedure  in 
applications  to  a  large  number  of  gas-heated  houses  is  discussed  in  Stram  et 
al  (1982). 

4.  Calculating  Approximate  Confidence  Intervals 

A 

Whatever  method  is  used  to  find  the  least  squares  estimate  t,  estimates 
of  the  intercept  a  and  slope  6  are  found  by  ordinary  linear  regression  of 

A 

the  fuel  data  Y  on  degree-days  H(t).  However,  the  standard  errors  computed 
from  the  linear  regression  formulae  will  understate  the  uncertainties  in  the 

A  A 

estimates  a  and  6.  The  understatement  results  from  the  assumption, 

A 

implicit  in  these  formulae,  that  the  "correct"  regressor  H(t)  was  known  (or 
fixed)  in  advance,  rather  than  being  estimated  from  the  data. 


Appropriate  measures  of  error  can  be  obtained  for  the  estimates  of  our 


model  using  methods  developed  for  general  nonlinear  models.  In  particular,  we 
will  consider  confidence  intervals  first  based  on  the  Gaussian  approximation 

A 

to  the  distribution  of  0,  and  secondly  based  on  the  sum-of- squares  or 
likelihood  method.  In  the  discussion  below,  these  two  methods  are  denoted  by 
(G)  for  Gaussian  and  (S)  for  sum-of-squares. 

The  validity  of  applying  these  methods  to  the  energy  model  is  discussed 
in  earlier  work  (Goldberg,  1982  and  1983).  In  the  present  work,  we  attempt  to 
find  efficient  means  of  computing  accurate  confidence  intervals.  After 
summarizing  the  Gaussian  and  sum-of-squares  methods,  with  particular  reference 
to  models  of  the  type  considered  here,  we  will  introduce  two  closely  related 
approximations.  From  these  two,  we  will  then  derive  a  superior,  composite 
approximation,  in  Section  5.  Since  the  sum-of-squares  intervals  (S)  are  in 
general  much  more  accurate  than  the  Gaussian  (G),  the  former  (S)  are  taken  as 
the  standard  against  which  other  approximations  are  compared. 

4.1.  The  Gaussian  Approximation 

Confidence  intervals  based  on  the  Gaussian  approximation  (G)  are  easily 
obtained  for  a  general  nonlinear  model  fit  by  the  Gauss-Newton  method.  The 

A 

variance-covariance  matrix  of  the  parameter  estimate  6  is  estimated  by 

*  *  *2  •  •  -1 . 

V(9)  -  a  (h'n)  .  ,  (4.1) 

le 


with 


Aa  a 

o  -  RSS(0)/(n-p)  .  (4.2) 

The  standard  error  a ^  of  the  component  of  0  is  the  square  root  of  the 
th  A  A 

j  diagonal  element  of  V(0).  The  confidence  interval  for  0^  of 
approximate  confidence  level  1  -  *  (for  some  small  probability  ir)  is 
therefore  bounded  by 

0.  ±  st*72  (4.3) 

j  j  n-p 
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f 


-  - 


a 


«  /2 

where  t  is  the  1  -  r/2  quantile  of  the  t-distribution  on  n-p  degrees 
n-p 

of  freedon. 

For  the  energy  model,  when  the  reference  temperature  t  is  estimated  by 
the  procedure  based  on  Newton' s  method,  approximate  confidence  intervals  for 
a,  6,  and  r  can  be  obtained  from  Equations  (4.1)  and  (4.3).  For  the  fourth 
parameter  of  interest,  T,  approximate  variance  and  covariances  are  obtained 
using  the  linearisation 

T  -  7  +  d(H0<T)  -  H(t  ) )  +  (t-t)S(Fq(t)  -  F( T) )  ,  (4.4) 

where  X  denotes  the  sample  mean  of  X^.  For  both  single  houses  and 
aggregates,  the  normalised  derivative  FQ  is  defined  by  Equation  (3.1),  with 
"month”  0  defined  as  for  HQ. 

A  A 

With  variances  and  covariances  denoted  respectively  by  V  and  C, 
evaluating  the  variance-covariance  matrix  for  the  four  parameters  yields 


V(B)  - 

A  A 

V(T)  - 

AAA 

C(B,T) 

A  A 

V(O)  - 


SHH(1-r2) 

“2 

q _ 

2  2 
B  sff( 1-r  ) 

“2 

-  a  SHF 

”  2 

BSHH  SFF(1-r  ) 

*2 

—  +  V(B) (H)2  +  2C(B,T)H  F  +  V(T) (BF)2 
n 


(4.5) 


V(f>  +V(B)(AH)2  +  2C(B,t)(AH)B(AF)  +V(t)(BAF)2 

n 


C(r,T)  -  AH  C(B,T)  +  BAF  V ( T ) 


i 
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where  r2  -  SHF2/(SHH  SFF) ,  AH  -  HQ  -  H,  and  AF  »  FQ  -  F,  all  evaluated  at 

A 

T.  In  Section  6,  the  Gaussian  approximation  represented  by  Equations  (4.3)  to 
(4.5)  will  be  applied  to  particular  data  sets  for  the  energy  model.  The 

A 

covariances  C  given  in  Equations  (4.5)  are  the  only  two  that  will  be  needed 
here. 

A  A 

Note  that  the  formula  for  V(D  is  the  same  as  would  be  given  by 
Equation  (4.1),  if  the  model  function  had  been  defined  in  terms  of  I*,  3,  and 
t  as 

n*(r,$,T)  ■  r  +  3(h  (t)-h.(t))  . 

in  in  u 

This  agreement  is  to  be  expected,  since  both  Equation  (4.4)  and  Equation  (4.1) 
are  based  on  a  linearization  of  the  model  function  n» 

4. 2.  The  Sum-of-Squares  Method 

The  second  basic  method  we  will  consider  for  obtaining  approximate 
confidence  intervals  is  the  sum-of-squares  method  (S).  For  a  general  model  of 
the  form  of  Equation  (1.1),  if  the  "nonlinear"  parameter  T  is  estimated  by  a 
grid  search,  the  boundaries  of  a  sum-of-squares  region  are  easily  computed 


during  the  estimation. 

For  any  single  parameter  0  ^ ,  the  confidence  region  of  approximate 
confidence  level  1  -  ir  is  the  projection  onto  the  0^  axis  of  the  region 


bounded  by  a  surface  satisfying 

A  A 

RSS(A.T)  -  RSS(l.T) 

A  A 

RSS(A,x)/(n-p) 


1,n-p 


(4.6) 


where  F,  is  the  1  -  it  quantile  of  the  F-distribution  with  1  and 

l,n-p 

n  -  p  degrees  of  freedom.  Regions  determined  by  Equation  (4.6)  are  sometimes 
called  "likelihood  regions." 

A 

Let  C  *  C(r),  and  denote  by  A  the  least-squares  estimate  of  A 
T  T 

from  the  linear  regression  of  Y  on  5^.  Equation  (4.6)  can  then  be 


rewritten  as 
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A  Am  ^  A  A  A 

RSS(A,X)  -  RSS ( X  ,X)  -OF.  -  (RSS ( X  ,t)  -  RSS(X,T) ) 

x  n-p  t 

or 

(A-A  )'(£',e  MA-A  )  -  o2(F*  -  f ( T ) )  (4.7) 

T  T  T  T  if  n-p 

A  A  A  A  ^ 

where  f(x)  *  {RSS(At,x)  -  RSS(A,x))/a  is  the  F-statistic  for  testing  the 

hypothesis  that  t  is  the  true  value  of  that  parameter.  The  left-hand  side 

of  the  equation  results  from  the  fact  that  the  model  function  is  linear  in  X 

for  fixed  T.  For  any  t  within  the  confidence  region  (of  approximate 

confidence  level  1  -  w )  given  by 

f(x)  <  F*  ,  (4.8) 

1#  n-p 

the  right-hand  side  of  Equation  (4.7)  is  positive,  so  that  the  equation  has 
real  solutions  in  X.  We  will  assume  that  the  region  in  T  defined  by 
Equation  (4.8)  is  an  interval,  which  we  denote  by  [t  ,t  ). 

Equation  (4.7)  defines  the  cross  section  at  x  in  the  X  subspace,  for 


the  surface  given  by  Equation  (4.6).  Equation  (4.7)  is  familiar  from  the 

theory  of  linear  models,  with  the  difference  F*  -  f(T)  taking  the  place 

i ,  n-p 

of  an  F-value  by  itself.  For  any  component  A^,  then,  the  extreme  values 
A*^  and  A^  for  which  Equation  (4.7)  has  a  solution  can  be  found  using  the 


linear  theory  formula 

A*  -  X,  ±  {V  (A  )(F*  -  f ( T ) ) } 1 /2 

jT  jX  "  X  jT  1,n-p 

where  ( A ^ ^ )  is  the  j  diagonal  element  of 

A  A  AM  ^4 

vv • 


(4.9) 


(4.10) 


A  confidence  interval  f X j ,A j j  is  then  obtained  by  maximizing  (minimizing) 
the  upper  limit  A_jt  (lower  limit  A^),  over  t  e  [x  ,x  ]• 

A  * 

The  matrix  VT  of  Equation  (4.10)  represents  the  variance  of  At 

assuming  x  to  be  known.  Note  that  in  Equation  (4.10)  the  estimated  sample 
*2 

variance  o  is  still  given  by  Equation  (4.2).  Therefore,  if  a  standard 


linear  regression  is  used  to  compute  X^,  the  variance  of  X^  as  calculated 

A  A  A 

by  the  routine  mast  be  multiplied  by  RSS( X,x) (n-p+1 )/[ (RSS( A^ ,x) (n-p) ]  to 
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produce  the  values  (\^ ) .  when  t  ■  r,  the  multiplier  reduces  to  a 
correction  factor  (n-p+1 )/( n-p)  for  the  degrees  of  freedom. 

4.3.  Applications  to  the  Energy  Model 

To  motivate  the  shortcut  confidence  regions  to  be  considered  next,  we 
apply  the  computational  method  just  described  to  the  parameters  of  the  energy 
model.  The  known-T  variances  given  by  Equation  (4.10)  become 

A  A  Am 

vt($t)  -  a  /SHH 

VT(aT)  -  o2(1/n  +  H2/SHH)  (4.11) 

Vt(?t)  -  c2(1/n  +  (H-H0)2/SHH) 

where  H  stands  for  H(t).  The  first  of  Equations  (4.11)  was  used  to 
evaluate  the  contour  shown  in  Figure  1. 

This  figure  displays,  for  a  particular  data  set  fit  to  Equation  (2.2), 
the  projection  onto  the  0  -  T  plane  of  a  surface  defined  by  Equation 

(4.6).  The  projection  shown  in  the  figure  does  not  represent  a  planar  cross 

section,  with  constant  a,  of  the  surface  defined  by  Equation  (4.6).  Rather, 
for  each  point  (8,t)  in  the  projection,  there  is  some  value  of  a  such  that 
Equation  (4.6)  is  satisfied. 

For  the  contour  displayed,  the  number  of  observations  n  *  12,  and 

F*  was  set  equal  to  9,  corresponding  to  a  nominal  confidence  level 

1  »n-p 

1  -  x  of  approximately  0.99  (for  n-p  ”  9  degrees  of  freedom).  The 
contour  was  evaluated  at  each  integer  t  satisfying  Equation  (4.8).  In 
addition,  the  endpoints  T  and  t+  were  found  in  closed  form  (that  is, 
without  iterative  or  grid  search)  once  the  integer  intervals  containing  these 
points  had  been  determined.  Similarly,  the  0-limits  0  and  0  were  found 
in  closed  form  after  the  extreme  values  for  integer  t,  and  the  corresponding 
integers,  had  been  identified.  The  procedures  used  to  compute  these  endpoints 
are  described  in  Goldberg  (1982,  Appendix  D). 
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Heating  Rate  £  (Th/cu-  Fd) 


Figure  X:  Sum-of -Squares  Contour  in  the  Plane 


Reference  Temperature  t  (°F) 


Based  on  a  fit  of  Equation  (2.2)  to  data  for  the  state  aggregate, 
August  1969  -  July  1970.  The  contour  is  defined  by  Equation  (4.6) 
with  Fj  g  ■  9  (ir  «  0.01).  The  abbreviations  are  Th  for  therms, 

cu  for  customer,  and  d  for  day. 
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The  overall  shape  of  the  contour  displayed  in  Figure  1  is  roughly 
elliptical,  but  slightly  bent.  If  the  contour  had  been  evaluated  over  a 


continuous  range  of  T,  rather  than  just  at  integers,  it  would  still  have  the 
sane  basic  shape,  with  cusps  or  kinks  at  integer  values  of  T,  where  the 
function  RSS  has  discontinuous  derivatives.  The  segments  between  integers, 
however,  would  be  curves,  not  straight  lines. 

4.3.  Shortcut  Approximations  to  the  Sum-of -Squares  Contour 

Even  with  the  simplification  represented  by  Equation  (4.9),  computing  the 
sum-of-equares  intervals  can  be  rather  involved,  as  indicated  above.  To 
compute  these  intervals  accurately,  a  very  fine  grid  must  be  used,  or 
additional  steps  must  be  taken  to  pinpoint  the  limits  once  they  have  been 
roughly  identified.  It  is  tempting,  therefore,  to  look  for  shortcut  methods 
to  approximate  a  sum-of -squares  region.  The  Gaussian  approximation  (G)  of 
this  region  by  an  ellipsoid  is  one  such  shortcut.  Before  examining  the 
accuracy  of  this  approximation  for  the  energy  model  (Section  6)  we  introduce 
three  other  shortcuts,  obtained  by  evaluating  points  on  the  actual  sum-of- 
squares  contour. 

Figure  1  shows  the  confidence  intervals  [0  ,0  1  and  [t  ,t  ]  as  the 
projections  of  the  sum-of-squares  contour  onto  the  corresponding  axes.  The 
figure  also  indicates  how  two  of  the  shortcuts  to  be  considered  correspond 
directly  to  points  on  the  contour. 

First  is  the  "fixed-T"  shortcut  (F) s  confidence  intervals  for  all 


parameters  except  t  are  calculated  as  if  t  had  been  fixed  in  advance.  For 
general  models  of  the  form  (1.1),  the  fixed-tau  limits  X^  and  are  of 

A  A 

the  form  of  Equation  (4.3),  but  with  VT(X),  as  given  by  Equation  (4.10)  with 

A  * 

r  ■  t,  used  to  estimate  the  variance  of  X.  These  limits  are  the  points 


X^T  given  by  Equation  (4.9)  for 


T,and  correspond  to  the  cross-section  of 
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the  contour  at  T.  For  the  energy  model  example  of  Figure  1,  the  f ixed-tau 
int  i  al  is  roughly  half  as  wide  as  the  BU»-of- squares  Interval 

t0",0+]. 

The  second  shortcut  displayed  Is  the  "extreme— r*  shortcut  (E).  For  the 
general  case,  the  extreme-tau  limits  X”  and  X*  are  obtained  by 
evaluating  X^  at  the  extreme  values  t”  and  r+.  implicit  in  this 
procedure  is  the  assumption  that  X^  could  be  determined  perfectly  if  T 

A 

were  known.  In  terms  of  the  sum-of-squares  contour,  the  points  (t~,X  )  and 

+  *  T 

(t  ,X  +)  are  the  extreme  points  in  the  t  direction.  In  Figure  1,  the 

T 

extreme-x  confidence  interval  [0",B  1  understates  the  width  of  the  sum-of- 

E  E 

squares  interval  by  only  about  15%. 

A 

The  fixed-tau  interval  (F)  indicates  the  uncertainty  the  estimate  6 
would  have  even  if  t  were  known,  while  the  extreme-tau  interval  (E)  shows 

A 

the  uncertainty  in  8  attributable  to  uncertainty  in  r.  The  correct 
confidence  interval  must  combine  these  two  sources  of  error.  As  will  be  seen 
in  the  next  section,  the  two  simple  shortcuts  can  be  combined  in  a  straight¬ 
forward  way  to  give  a  confidence  interval  quite  close  to  that  obtained  by  the 
complete  sum-of-squares  method.  This  composite  shortcut  will  be  shown  in 
Section  6  to  perform  extremely  well  for  all  the  conditionally  linear 
parameters  a,  8,  and  T  of  the  energy  model. 

5.  The  Composite  Shortcut 

If  the  energy  model  function  n  were  linear  in  all  parameters,  the 
contour  shown  in  Figure  1  would  be  a  perfect  ellipse.  The  equation  for  this 
ellipse  is  conveniently  expressed  as 

(~)2  +  (|-)2  -  2p(|-)(2-)  -  1-p2  ,  (5.1, 

0  z0  *0  zo 


where 


X  "  T-T 


z  -  0-6 

*0  ■  V,rl*>,w 

A  A 

p  ■  Corr(0,T)  . 

For  the  general  model  of  the  form  (1.1),  the  parameter  6  can  be  understood 
to  represent  any  conditionally  linear  parameter  X^. 

An  arbitrary  ellipse  centered  at  the  origin  can  be  written  in  the  form  of 
Equation  (5. 1),  with  Xq  and  ZQ  positive.  Since  the  equation  has  no  real 
solution  (X,z)  for  |X|  >  |xQ|  or  |z|  >  |ZQ|,  the  values  ±*0  and  ±ZQ 
respectively  represent  the  extreme  values  of  X  and  z  on  the  ellipse. 

Now  assume  that  we  know  two  points  on  the  ellipse.  Once  is  (XQ,ZB), 
the  single  point  satisfying  Equation  (5.1)  at  the  extreme  value  X  -  Xq.  The 
other  is  ( 0 , ZF ) ,  one  of  the  two  points  satisfying  the  equation  for  X  -  0, 
Solving  the  equation  for  Z£  and  Z^,  we  find 


ZE  "  PZ0 


(5.2) 


As  a  result,  we  have 


fyf-'-*2  • 


2  2  2 
Z0  *  ZE  +  ZF  * 


(5.3) 


(5.4) 


Next,  assume  X  and  Z  are  random  variables  following  a  bivariate 

2  2 

normal  distribution  with  mean  sero,  variances  (kXQ)  and  (kZQ ) 
respectively  (for  some  constant  k)  and  correlation  p.  Xt  is  a  straight¬ 
forward  exercise  to  show  that  the  conditional  mean  and  variance  of  Z  are 


given  by 


e(z|x)  -  p  r-  (zft) 
*0  u 


(5.5) 


E((Z-E(Z|X))2|X]  -  (1-p2)(kZ0)2  . 


(5.6) 


I 


Comparing  Equations  (5.S)  and  (5.6)  with  Equations  (5.2)  and  (5.3),  we  sas 
2  2 

that  (kZg)  and  (kZp)  respectively  raprasant  tha  varianca  of  tha 
conditional  expactation  of  Z  givan  X,  and  tha  axpactation  of  tha 
conditional  varianca  of  Z  givan  X. 

Equation  (5.4)  is  thus  a  rasta tenant  of  tha  classical  decomposition  of 


varianca 


Var(Z)  -  Varx(B(Z|X))  +  Ex(Var(Z|X))  . 


(5.7) 


Equation  (5.7)  may  be  found,  for  example,  in  Rao  (1979,  p.  97).  This 


decomposition  applies  to  general,  jointly  distributed  randan  variables.  Tha 

2 

assumption  of  normality  yields  tha  components  in  proportion  to  p  and 

1-P2. 

Finally,  we  return  to  our  original  interpretation  of  X  and  z  as 

A  A  A  A 

x  -  t  and  5  -  0,  respectively,  for  a  givan  pair  of  estimates  (t,8).  with 
tha  modal  function  n  still  assumed  linear.  Equations  (5.2)  -  (5.4)  than 


became 


♦  *2  ?  +  '  2  +*2 
(8  -0)  -  P*(0  -0)  ♦  (1-P  )(0  -0) 

-  (B*-B)  ♦  (Bp-B)  , 


(5.8) 


and  similarly  for  B  • 

The  ellipse  defined  by  Equation  (5.1)  now  represents  a  confidence  region 
for  the  unknown  parameters  B  and  t,  rather  than  a  prediction  region  for 
two  random  variables  with  sero  means.  Nevertheless,  Equation  (5.8)  can  still 
be  interpreted  in  terms  of  the  decomposition  given  by  Equation  (5.7).  A 
similar  decomposition,  given  below,  relates  more  directly  to  the  confidence 
interval  for  B • 

First,  corresponding  to  the  conditional  expectation  E(Z|X),  the  least* 
squares  estimate  of  B  at  a  particular  assumed  value  t  can  be  expressed  as 


;  +  p  ill  (t-t)  . 

/  Var  (T) 


(5.9) 
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As  a  result. 


A  A  A  A  A  a  A 

Var(8T-0)  5  e<$t  -  0)  -  p  Var(8>  .  (5.10) 

If  8  could  be  determined  perfectly  if  T  were  known,  as  assumed  by  the 

extresie-tau  method  (E),  then  the  left-hand  side  of  Equation  (5.10)  would 

*  2 

represent  the  variance  of  8,  and  we  would  have  p  •  1.  Thus,  as  indicated 

earlier  by  Equation  (5.2),  the  extreme-tau  method  amounts  to  understating 

*  2 
Var(B)  by  a  factor  of  P  • 

For  the  second  component  of  the  variance,  corresponding  to  the 

A 

conditional  variance  Var(Z|X),  consider  the  variance  of  8T,  the  estimate 
of  8  when  t  is  known  (or  fixed  in  advance).  For  a  linear  model  with 
uncorrelated,  constant  variance  errors,  this  variance  is  related  to  the 

A 

unknown-t  variance  Var(B)  by 

Var(8T>  -  (1-p2 )Var(8)  .  (5.11) 

For  the  energy  model,  for  example,  in  the  case  of  known  T  we  have 

Var(ST)  *  o2/SHH  , 

whereas  when  t  is  estimated  the  linear  approximation  gives 

Var(8 )  -  02/SHH(1-p2)  . 

For  the  general  case.  Equations  (5.10)  and  (5.11)  together  give  a 
decomposition  analogous  to  Equation  (5.7).  Assuming  t  to  be  the  true  value 
of  that  parameter, 

Var ( 8 )  -  Var(8T-8)  ♦  Var(8T>  .  (5.12) 

This  decomposition  coincides  with  that  given  by  Equation  (5.7)  when  the 

A  A 

estimates  t  and  8  follow  a  normal  distribution. 

Equation  (5.12)  can  now  be  understood  as  the  formal  expression  of  an 

+  * 
earlier  statement.  The  extreme-tau  limit  8.  represents  the  error  in  8 

s 

associated  with  error  in  T,  as  given  by  Equation  (5.10).  The  fixed-tau 


+ 

limit  8^  represents  the  error  in  8^  for  a  given  T,  as  in  Equation 
(5.11).  Equation  (5.12)  shows  how  the  true  limit  8+  combines  these  two 
sources  of  error,  for  a  linear  model  function. 

For  the  nonlinear  model  functions,  the  composite  shortcut  (C)  given  by 
Equation  (5.8)  offers  a  simple  approximation  to  8  •  One  indication  that  this 
approximation  should  be  good  for  the  parameters  of  the  energy  model  is  the 
fact  that,  for  the  data  set  of  Figure  1,  the  shortcut  limits  8_  and  8„  are 
quite  close  to  those  indicated  by  Equations  (5.2)  and  (5.3).  The  correlation 

A  A 

p  between  8  and  t,  based  on  Equation  (4.5)  for  the  Gaussian  approxi¬ 
mation,  is  -0.83  for  this  data  set.  Thus,  Equations  (5.2)  and  (5.3)  imply 
that  the  extreme-tau  (E)  and  fixed-tau  (F)  shortcuts  should  understate  the 
sum-of-squares  interval  lengths  (S)  by  17%  and  44%,  respectively,  as  compared 
with  the  actual  understatements  of  15%  and  45%. 

As  will  be  seen  in  the  next  section,  the  composite  shortcut  is  indeed 
excellent  for  the  parameters  a,  8,  and  T  of  the  energy  model.  After 
examining  the  performance  of  this  shortcut  for  our  motivating  model,  we  will 
turn  in  Section  7  to  considerations  for  general  models  of  the  form  (1.1),  with 
an  arbitrary  parameter  taking  the  place  of  8. 

6.  Application  of  Approximation  Methods  to  the  Energy  Model 

Figure  2  compares  confidence  intervals  obtained  by  the  sum-of-squares 

method  (S)  and  by  the  different  approximations  for  the  data  set  of  our  earlier 

example.  For  each  parameter  a,  8,  T,  and  t  of  the  energy  model,  the 

different  upper  and  lower  confidence  limits  are  plotted  as  functions  of  the 

w/2 

nominal  confidence  level,  or  equivalently,  of  t  9  .  For  the  reference 
temperature  t,  only  the  sum-of-squares  (S)  and  Gaussian  (G)  methods  apply. 
The  sum-of-squares  (S),  extreme-tau  (E),  and  composite  (C)  limits  were 
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Figure  2.  Confidence  Intervale  by  Different  Approximation  Methods 

3-ir :  Confidence  Level  1-w  :  Confidence  Level 


7.8.9.95.96.99  .999 


t*/2  ,  t-eelue  tVZ  •  t— value 

9  9 


The  figure  shove  confidence  intervals  for  the  energy  nodal  parameters  obtained 
by  the  Gaussian  (G),  Sun-of-Squaree  (S ) ,  Fixed- t  ( F) ,  Extreme-T  (K ) ,  and 
Composite  (C)  methods.  The  nominal  confidence  level  is  indicated  at  the  top 
and  the  corresponding  t- value  at  the  bottom  of  each  graph.  See  caption  to 


Figure  1 


computed  only  for  integer  t- values.  Since  the  Gaussian  (G)  and  fixed-tau  (F) 
lisdLts  are  linear  in  the  t- value,  these  limits  are  shown  exactly  over  the 


range  displayed. 

6.1.  Comparisons  for  Baseload,  Heating  Bate,  and  Reference  Temperature 

For  the  intercept  o,  slope  0,  and  reference  temperature  t.  Figure  2 

shows  the  Gaussian  and  sum-of-squarea  limits  in  fairly  close  agreement  for 

lower  confidence  levels,  though  they  diverge  at  higher  confidence  levels.  For 

data  sets  like  this  one,  then,  we  might  be  satisfied  with  the  Gaussian 

approximation  for  any  confidence  level  of  practical  interest. 

However,  in  applications  of  the  energy  model  where  the  standard  errors  of 

the  estimates  are  relatively  large  compared  to  those  seen  here,  these  two  sets 

of  limits  will  tend  to  diverge  earlier,  in  particular,  for  data  sets  with  a 

“2 

high  sample  variance  o  or  with  too  few  observations,  the  Gaussian 
approximation  may  be  unreliable.  Such  data  sets  have  been  found  occasionally 
in  single-house  analyses. 

When  the  Gaussian  approximation  (G)  is  suspect,  the  composite  method  (C) 

offers  a  convenient  Improvement  for  the  linear  parameters .  For  both  a  and 

0,  the  composite  limits  follow  the  sum-of-squares  limits  (S)  quite  closely, 

w/2 

only  a  slight  discrepancy  being  seen  for  t?  -  5,  corresponding  to  a 
confidence  level  greater  than  0.999.  For  a  as  well  as  for  0,  the  two 
components  (B)  and  (F)  of  the  composite  approximations  are  of  roughly  equal 
magnitude,  so  that  neither  alone  is  adequate. 

6.2.  Approximations  for  normalized  Annual  Consumption 

A  different  pattern  is  seen  for  the  index  T,  of  major  interest  in 
energy  analyses.  The  curves  for  the  Gaussian  (G),  fixed-tau  (F),  and 
composite  (C)  approximations  are  Indistinguishable  from  those  for  the  sum-of- 


k 


-21 


square*  limits  (S)  in  Figure  2.  The  reason  the  four  methods  coincide  is  that 

A 

the  index  I*  is  quite  insensitive  to  the  nonlinear  parameter  t,  as 
indicated  by  the  narrow  bands  of  the  extreme-tau  (B)  approximation*  For  this 
parameter  then,  it  is  reasonable  to  ignore  the  estimation  error  associated 

A 

with  error  in  t,  relying  simply  on  the  fixed-tau  method  (F).  The  composite 
(C)  and  Gaussian  (G)  approximations  represent  negligible  corrections  over  the 
fixed-tau  method  alone.  Because  the  accuracy  of  the  fixed-tau  method  is  quite 
useful  in  practice,  we  consider  in  more  detail  what  circumstances  make  this 
method  reliable. 

A 

Equations  (5.2)  and  (5.3)  indicate  that  more  of  the  variance  of  T  will 
be  associated  with  the  extreme-tau  limits,  and  less  with  the  fixed-tau  limits, 

A  A 

when  the  covariance  p  between  T  and  T  is  large.  The  magnitude  of  this 

AAA 

covariance  (C(T,t)  in  Equation  (4.5))  tends  to  be  larger  for  larger  values 

A 

of  AH  and  AF.  The  general  insensitivity  of  to  T  stems  from  the  fact 

that  normal  degree-days  HQ (r )  and  its  derivative  Fq(t)  are  usually  close 
to  the  corresponding  sample  means  H(t)  and  F(t),  so  that  the  terms  AH 
and  Af  are  small. 

A 

However,  the  insensitivity  of  to  T  may  not  hold  if  the 

observations  Ym  are  not  taken  over  an  integral  number  of  years,  or  are  taken 
at  unevenly  spaced  times.  In  such  cases,  the  differences  AH  and  AF 
between  the  base-period  values  and  the  sample  means  are  likely  to  be  large. 
Such  irregularly  occurring  data  are  common,  for  example,  for  fuel  oil 
deliveries,  which  are  more  frequent  during  cold  months.  As  a  result,  for  some 

A  A 

oil-heated  houses  the  correlation  p  between  T  and  t  is  occasionally  too 

high  for  the  fixed-tau  (F)  approximation  to  be  trusted. 

For  the  example  data  set  of  Figure  2,  AH  “  1.3°F  and  AF  “  0.012, 

A  A 

while  the  correlation  p  between  T  and  T  is  0.07.  The  situation  is 

quite  different  for  a  particular  oil-heated  house  in  the  same  region.  For 
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that  house,  all  but  one  of  the  nine  observations  represented  winter 


consumption?  the  corresponding  values  are  AH  *  7.6°F,  AF  ”  0.426  and  p  “ 
0.6.  In  this  case,  for  ir  >0.05  the  fixed-tau  limits  (F)  understate  the 
sum-of- squares  limits  (S)  by  20%  (as  compared  with  a  negligible  amount  for  the 
data  set  in  Figure  2).  By  contrast,  the  Gaussian  interval  of  [434,938] 
(gallons  per  year)  is  quite  close  to  the  sum-of-squares  interval  [412,927]. 

The  composite  interval  is  even  closer:  [423,923]. 

Thus,  the  same  relationship  that  was  found  for  the  parameters  a  and  0 
of  the  ordinary  aggregate  gas  data  set  displayed  in  Figure  2  holds  also  for 
the  index  T  even  in  a  case  where  the  fixed- tau  approximation  (F)  is 
unreliable.  That  is,  the  Gaussian  approximation  (G)  seems  to  be  adequate  for 
confidence  levels  of  practical  interest,  but  the  composite  shortcut  (C)  is 
still  more  accurate.  In  the  next  section  we  will  discuss  why  the  composite 
should  be  superior  in  general  for  models  of  the  form  (1.1). 

6.3.  Implications  for  Other  Biergy  Data  Sets 

A  total  of  75  aggregate  gas  data  sets  were  fit  by  Equation  (2.2),  with 
the  degree-day  variable  defined  by  Equation  (2.3).  Although  they  were  not 
studied  in  the  same  detail  as  the  data  set  used  for  Figures  1  and  2,  our 
analysis  indicates  that  the  rest  of  these  data  sets  should  behave  similarly. 
For  one  thing,  the  (Gaussian  approximation)  correlations  among  the  parameter 

AAA  A 

estimates  a,  0,  t,  and  T  were  similar  for  all  the  data  sets  studied. 

Secondly,  as  discussed  elsewhere  (Goldberg,  1983),  the  example  data  set  seems 

to  be  typical  in  terms  of  the  severity  of  nonlinearities  which  might  distort 

the  sum-of-squares  contour  from  an  ellipse.  From  the  previous  section,  we 

know  that  the  success  of  the  extreme-tau  (E)  and  fixed-tau  (F)  shortcuts 

depends  on  the  magnitude  of  the  correlation  p,  while  the  success  of  the 

composite  shortcut  (C)  depends  on  how  nearly  elliptical  the  sum-of-squares 
contour  is. 
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7.  Advantages  of  the  Composite  Shortcut 

In  applying  the  approximation  represented  by  Equation  (5.3)  to  a  general 
model  of  the  form  (1.1),  two  considerations  are  important:  ease  of  computa¬ 
tion,  and  accuracy.  As  noted  above,  we  here  measure  accuracy  in  terms  of  how 
closely  the  composite  limits  (C)  approximate  the  sum-of-squares  limits  (S). 

Computation  of  the  composite  limits  (C)  is  considerably  easier  than 

A 

finding  the  sum- of- squares  limits  (S).  Once  the  least  squares  estimate  6 
has  been  found,  a  nonlinear  equation  must  be  solved  only  to  obtain  the  limits 
T  and  t+  for  the  nonlinear  parameter  T.  For  the  linear  parameters  , 
the  extreme-tau  limits  (E)  are  then  easily  obtained  as  the  coefficients  from  a 
linear  regression  at  x+  and  t  ,  and  the  fixed-tau  limits  (F)  as  confidence 

A 

interval  bounds  for  a  linear  regression  at  t. 

If  the  model  has  been  fit  using  the  derivatives  n,  as  with  the  commonly 
used  procedures  based  on  Newton's  method,  the  composite  shortcut  (C)  will 
require  more  computational  effort  than  the  Gaussian  approximation  (G). 

However,  for  some  types  of  models,  particularly  non-differentiable  '<s»s, 
derivative- based  fitting  procedures  may  be  impractical.  The  complexity  such 
methods  can  impose  was  indicated  in  Section  3.  Even  for  continuously 
differentiable  model  functions  n,  it  may  be  easier,  and  more  informative, 
for  an  analyst  to  run  a  series  of  linear  regressions,  varying  the  parameter 
t,  than  to  implement  a  nonlinear  regression  routine  correctly  for  his 
model.  Provided  a  wide  enough  range  of  values  of  T  was  used  initially,  the 
composite  shortcut  (C)  can  be  computed  directly  from  the  output  of  such  a 
series  of  linear  regressions. 

Whatever  the  fitting  procedure  used,  both  the  Gaussian  (G)  and  fixed-tau 
(F)  methods  offer  the  computational  advantage  of  giving  limits  for  any 
confidence  interval  of  interest  by  multiplying  a  single  standard  error 
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estimate  by  the  appropriate  t-value.  Further,  the  latter  method  (F)  does  not 
require  the  derivative  of  the  model  function  n  with  respect  to  the  nonlinear 
parameter  t.  However,  the  linear,  t-based  confidence  intervals  (F)  and  (G) 
cannot  reflect  any  asymmetry  in  the  likelihood  (8)  region.  As  can  be  seen  in 
Figure  2  of  the  previous  section,  this  asymmetry  is  captured  by  the  extreme- 
tau  method  (E),  and  is  appropriately  incorporated  in  the  composite  (C). 

For  the  energy  model,  the  composite  approximation  (C)  was  seen  in  the 
previous  section  to  be  quite  accurate,  and  superior  to  the  Gaussian 
approximation  (G).  The  difficulty  with  the  Gaussian  approximation  in  general 
is  that  any  nonlinearity  in  the  model  function  will  make  the  Gaussian  ellipse 
different  from  the  sum-of -squares  contour,  even  though  many  types  of 
nonlinearity  will  leave  the  sum-of- squares  contour  itself  essentially 
elliptical. 

Hamilton,  et  al  (1981)  presented,  as  an  improvement  over  the  Gaussian,  an 
explicit  elliptical  approximation  to  the  sum-of-squares  regions,  based  on  a 
second-order  expansion  of  the  model  function  n.  Their  method  applies  to 
general  continuously  differentiable  models,  but  requires  that  the  model 
function  have  no  "parameter-effect"  nonlinearities.  As  shown  in  related  work 
by  Bates  and  Watts  (1980  and  1981),  this  requirement  is  not  often  met  by 
models  under  a  natural  parameterization,  nor  is  it  easily  achieved  by 
reparameterization  for  most  models. 

The  composite  method  is  accurate  so  long  as  the  sum-of-squares  region  is 

roughly  elliptical,  even  if  the  ellipse  does  not  coincide  with  any  explicit 

approximation  we  might  consider.  Further,  since  the  composite  method 

* 

approximates  the  two  halves  of  the  sum-of-squares  region  (for  t  >  t  and  for 

A 

t  <  t)  separately,  the  procedure  can  perform  well  even  in  the  presence  of 
substantial  asymmetry,  unlike  any  single  elliptical  approximation.  Finally, 
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an  important  consideration  in  the  present  study,  the  composite  method  can  be 
applied  to  both  continuously  and  piecewise  differentiable  models*  Extensions 
of  the  method  to  accommodate  joint  confidence  regions  or  models  with  more  than 
one  nonlinear  parameter  are  also  possible;  the  practicality  of  such  procedures 
remains  to  be  investigated. 
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estimate  by  the  appropriate  t- value.  Further,  the  latter  method  (F)  does  not 
require  the  derivative  of  the  model  function  n  with  respect  to  the  nonlinear 
parameter  T.  However,  the  linear,  t-based  confidence  intervals  (F)  and  (G) 
cannot  reflect  any  asymmetry  in  the  likelihood  (S)  region.  As  can  be  seen  in 
Figure  2  of  the  previous  section,  this  asymmetry  is  captured  by  the  extreme- 
tau  method  (E),  and  is  appropriately  incorporated  in  the  composite  (C). 

For  the  energy  model,  the  composite  approximation  (c)  was  seen  in  the 
previous  section  to  be  quite  accurate,  and  superior  to  the  Gaussian 
approximation  (G).  The  difficulty  with  the  Gaussian  approximation  in  general 
is  that  any  nonlinearity  in  the  model  function  will  make  the  Gaussian  ellipse 
different  from  the  sum-of-squares  contour,  even  though  many  types  of 
nonlinearity  will  leave  the  sum-of-squares  contour  itself  essentially 
elliptical. 

Hamilton,  et  al  (1981)  presented,  as  an  improvement  over  the  Gaussian,  an 
explicit  elliptical  approximation  to  the  sum-of-squares  regions,  based  on  a 
second-order  expansion  of  the  model  function  n.  Their  method  applies  to 
general  continuously  differentiable  models,  but  requires  that  the  model 
function  have  no  "parameter-effect"  nonlinearities.  As  shown  in  related  work 
by  Bates  and  Watts  (1980  and  1981),  this  requirement  is  not  often  met  by 
models  under  a  natural  parameterization,  nor  is  it  easily  achieved  by 
reparameterization  for  most  models. 

The  composite  method  is  accurate  so  long  as  the  sum-of-squares  region  is 
roughly  elliptical,  even  if  the  ellipse  does  not  coincide  with  any  explicit 
approximation  we  might  consider.  Further,  since  the  composite  method 

A 

approximates  the  two  halves  of  the  sum-of-squares  region  (for  t  >  T  and  for 

A 

t  <  t)  separately,  the  procedure  can  perform  well  even  in  the  presence  of 
substantial  asymmetry,  unlike  any  single  elliptical  approximation.  Finally, 
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an  important  consideration  in  tha  present  study,  the  composite  method  can  be 

i 

applied  to  both  continuously  and  piecewise  differentiable  models.  Extensions 
of  the  method  to  accoemodate  joint  confidence  regions  or  models  with  more  than 
one  nonlinear  parameter  are  also  possible*  the  practicality  of  such  procedures 
remains  to  be  investigated. 


APPENDIX 


Following  is  the  procedure  used  to  fit  Equation  (2.2).  For  more  details, 

see  Stram,  et  al  (1982).  The  terms  and  T^n  respectively  denote  the 

highest  and  lowest  temperatures  Tmj  in  the  data  set. 

(0) 

1 )  Choose  a  starting  value  t  ,  a  maximum  number  IMAX  of  iterations  i , 

and  checking  ranges  N1  and  N2.  (Recommended  values  are  ■  65°f, 

IMAX  -  10,  N1  »  4,  N2  -  10.)  Set  i  -  0. 

Iterative  search  by  Newton’s  method 

2)  Find  from  Equation  (3.4). 

A 

3)  Find  T  by  one  of  the  following,  and  choose  the  corresponding  checking 
range  NCHECK : 


a) 

If 

i  >  IMAX,  set 

*  (i) 

T  =  (NCHECK”  N2). 

b) 

If 

T(1+1>  >  T 

max 

or  T<i+1)  <  T  ,  set  T  ”  T{i> 

un 

(NCHECK  «  N2). 

c) 

If 

T(i+1)  a  TU-1 ) 

*  (i) 

,  set  t  «  integer  between  T 

and  x<i+1> 

(NCHECK  *  N1). 

d) 

If 

_(i+1)  _  _(i) 

1  “  T  f 

set  T  ”  T(1)  (NCHECK  -  N1). 

e) 

Otherwise,  replace 

i  by  i  +  1  and  return  to  step 

2. 

Local  grid  search 

A  A 

4)  Set:  k  ”  integer  part  of  the  current  estimate  T > 

kmin  *  max(Tmin'  *-NCHECK>  * 

W  “  min<TmaX,  ™ECK). 

5)  a)  Repeat  for  k  -  to  k,,,^  then  go  to  b): 

A  A  A 

If  RSS(k)  <  RSS(t),  replace  T  by  k  and  replace  k  by  k. 

A  A 

b)  If  T  -  k  .  or  T  ■  k _  continue  to  c). 

min  max 

Otherwise  go  to  d). 
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c)  ( For  the  rare  cases  whan  3a)  or  3b)  gives  an  initial  valua  t  close 

to  Tain  or  *aax*> 

*  * 

For  k  »  T  -  k^n  ( kMtt ) ,  rasat  k,,^  (k,^)  as  in  Stap  4.  If  tha 
"resat"  aakes  no  change,  go  to  d).  Otherwise,  return  to  a)  for  k  - 

Sain  to  k  (k  to  k««)* 

A  A 

d)  For  k  “  k-1  and  k  ■  ki 

*  (i+1) 

Find  as  T  frost  Equation  (3.4),  evaluated  for 

A  AAA 

If  e  (k,k+1)  and  RSS^)  <  rss(t),  replace  t  by 

6)  Cosiputatlon  of  reaaininq  parameters 
Calculate 

A 

0  -  (SHY/SHH) 

|T 

»  *_  * 
o  -  Y  -  0H(T ) 

A  A  A  A 

T  -  a  ♦  $Hq(t) 

A  procedure  described  by  Hinkley  (1971)  for  fitting  a  very  siaiilar  change 
point  aiodel  is  equivalent  to  starting  at  step  (5),  and  letting  k  range 
from  Tmin  ♦  1  to  Tb#j(  -  1.  The  full  procedure  given  here  greatly  reduces 
the  total  nuaber  of  function  evaluations  required,  at  the  risk  of  settling  on 
a  local  ainiaizer  which  is  not  ths  global  ainiaiser  of  RSS(T).  The  values  Ml 
and  N2  have  been  chosen  to  reduce  this  risk  to  a  very  low  level  (see  Goldberg, 
1982). 


-28- 


REFERENCES 


Bates,  D.  M.  and  Watts,  0.  G.  (1980).  "Relative  Curvature  Measures  of 
Nonlinearity."  J.R.S.S.  B  42  :1-25. 

_  (1981).  "Parameter  Transformations  for  Improved 

Approximate  Confidence  Regions  in  Nonlinear  Least  Squares."  Annals  of 
Statistics  9  :  1152-1167. 

Dutt,  G.  8.,  Imvine,  M. ,  Levi,  B.,  and  Socolow,  R.  (1982).  "The  Modular 
Retrofit  Experiment:  Exploring  the  House  Doctor  Concept."  Princeton 
University  Center  for  fiivironmental  Studies,  Princeton,  NJ,  Rept.  No. 

130. 

Pels,  M.  F.  and  Goldberg,  M.  L.  (1982).  "Measuring  Household  Fuel  Consumption 
on  the  Standard  Living  Cycle."  Energy  7  :  489-504. 

Fels,  M.  F.,  Goldberg,  M.  L. ,  Levine,  M.  L. ,  So colow,  R.  H. ,  and  Abrams,  P.  S. 
(1982).  “Exploratory  Analysis  of  Oil-Heated  Houses."  Princeton 
University  Center  for  Energy  and  Bivironmental  Studies,  Princeton,  NJ, 
Report  No.  139. 

Goldberg,  M.  L.  (1982).  "A  Geometrical  Approach  to  Nondifferentiable 

Regression  Models  as  Related  to  Methods  for  Assessing  Residential  Energy 
Conservation."  Princeton  Uhiversity  Center  for  Energy  and  Environmental 
Studies,  Princeton,  NJ,  Rept.  No.  142;  Ri.D.  dissertation  for  the 
Department  of  Statistics. 

______________  (1983).  "Measures  of  Nonlinearity  for  Segmented  Regression 

Models."  University  of  Wisconsin-Madison,  Mathematics  Research  Center, 
Technical  Summary  Report  #2549. 

Hamilton,  D.  C.,  Watts,  D.  G. ,  and  Bates,  D.  M.  (1982).  "Accounting  for 
Intrinsic  Nonlinearity  in  Nonlinear  Regression  Parameter  Inference 
Regions.”  Annals  of  Statistics  10  :  386-393. 


national  Oceanic  and  Atnoapharic  Administration,  U.  S.  Department  of  Commerce 
(monthly  publication).  "Local  Climatological  Data,  Monthly  Summary , 
Newark,  New  Jersey.” 

St ram,  D. ,  Dutt,  G.  S.,  Pels,  M.  F.,  and  Goldberg,  M.  L.  (1982).  "The  Score- 
keeping  Model  for  Residential  Energy  Consumption:  Procedures  and 
Problems."  Princeton  University  Center  for  Energy  and  Environmental 
Studies,  Princeton,  NJ,  Rapt.  No.  147. 


MLG/PB/Mff/jva 


SECURITY  CLASSIFICATION  of  This  RAGE  fWN««  D«»  Bntarag) 


REPORT  DOCUMENTATION  PAGE 


REPORT  NUMBER 

#2601 


A.  TITLE  fan 4  SuMilaJ 


BEAD  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


SIMPLIFIED  CONFIDENCE  INTERVALS  FOR  AN 
AWKWARD  NONLINEAR  MODEL 


7.  AUTHOR/ 

Miriam  L.  Goldberg,  Peter  Bloomfield 
and  Margaret  F.  Fels 


».  PERFORMING  ORGANIZATION  NAME  ANO  ADDRESS 

Mathematics  Research  Center,  University  of 
610  Walnut  Street  Wisconsin 

adison,  Wisconsin  53706 


IO 


•  I.  CONTROLLING  OFFICE  NAME  ANO  ADDRESS 


See  Item  18  below 


■ 

i 


s  type  of  report  a  period  covered 
Summary  Report  -  no  specific 
reporting  period 


S.  PERFORMING  ORG.  REPORT  NUMBER 


I-  CONTRACT  OR  GRANT  NUMBER?*) 

MCS-7 927062,  Mod.  2 

DAAG29-80-C-0041 


to.  PROGRAM  element,  project,  task 

AREA  B  WORK  UNIT  NUMBERS 

Work  unit  Number  4  * 
Statistics  6  Probability 


tz.  report  oate 
November  1983 


IS.  NUMSER  OF  PAOES 

30 


MONITORING  AGENCY  NAME  B  AOORESSflf  <0  Ha  fan  I  tram  Can  trailing  Off  Ira)  I  IS.  SECURITY  CLASS.  fa>  CM  a  rayan) 


UNCLASSIFIED 


t«.  DISTRIBUTION  STATEMENT  fof  Ma  R apart; 

Approved  for  public  release;  distribution  unlimited. 


NEOULC 


ICATI  ON/ DOWNGRADING 


17.  DISTRIBUTION  STATEMENT  ( at  tha  abstract  antarag  m  Blank  iO,  II  WHwaal  tnm  Itapart) 


IF.  SUPPLEMENTARY  NOTES 

U.  S.  Army  Research  Office  and 

P.  O.  Box  12211 
Research  Triangle  Park 
North  Carolina  27709 


It.  KEY  WORDS  fConffnua  on  faaaraa  alda  II  macaatarr  M  MaattF  ky  alaak 


National  Science  Foundation 
Washington ,  DC  20550 


Nonlinear,  Regression,  Confidence  Intervals ,  Energy 


20.  ABSTRACT  (Cmntimm  «n  r«v«rM  elcfe  If  AtcifHir  m 4  If  roe*  m m**) 

A  simple  method  is  developed  for  approximating  confidence  intervals  for  a 
linear  change-point  model  representing  residential  energy  consumption.  The 
method  can  be  applied  to  any  nonlinear  model  which  is  conditionally  linear  in 
all  but  one  parameter .  The  approximation  is  shown  to  be  more  accurate  than  the 
Gaussian  approximation,  and  easier  to  compute  than  confidence  intervals  defined 
by  likelihood  (sum-of -squares )  regions.  Computations  are  described  in  some 
detail  for  the  motivating  model. 


EOITION  OF  t  NOV  BE  IS  OBSOLETE 


UNCLASSIFIED _ 

SECURITY  CLASSIFICATION  OF  THIS  PAOC  fWhMi  I 


